SemaForm: Semantic Wrapper Generation for Querying Deep Web Data Sources
نویسندگان
چکیده
A wealth of data on the World Wide Web is hidden behind web form query interfaces and cannot be found through regular search engines. Querying across multiple such sources is a tedious and error-prone process; it involves manually filling in many related, but different, web forms. SemaForm automates this process by correlating web form labels to entries in a domain ontology through the use of a continually refined knowledge base and various techniques for semantic matching.
منابع مشابه
SemaForm: Semantic Wrapper Generation for Querying Deep Web Data Sources (Interim Report)
A wealth of data on the World Wide Web is hidden behind web form query interfaces and cannot be found through regular search engines. Querying across multiple such sources is a tedious and error-prone process; it involves manually filling in many related, but different, web forms. SemaForm automates this process by correlating web form labels to entries in a domain ontology through the use of a...
متن کاملDeveloping a BIM-based Spatial Ontology for Semantic Querying of 3D Property Information
With the growing dominance of complex and multi-level urban structures, current cadastral systems, which are often developed based on 2D representations, are not capable of providing unambiguous spatial information about urban properties. Therefore, the concept of 3D cadastre is proposed to support 3D digital representation of land and properties and facilitate the communication of legal owners...
متن کاملGeneral Strategy for Querying Web Sources in a Data Federation Environment
Modern database management systems are supporting the inclusion and querying of nonrelational sources within a data federation environment via wrappers. Wrapper development for Web sources, however, is a convolution of code with extraction and query planning knowledge and becomes a daunting task. We use IBM DB2 federation engine to demonstrate the challenges of incorporating Web sources into a ...
متن کاملXML Based Semantic Data Grid Service
This paper introduces a novel wrapper-mediator based semantic data grid service mechanism to solve the problem of Semantic heterogeneity and few compatible data sources. It uses ontology based semantic information to wrap the heterogeneous data source, and employs mediator structure to supply accessing interface for the data sources, and it extends semantic query, mapping and fusion languages t...
متن کاملAn XML-enabled data extraction toolkit for web sources
The amount of useful semi-structured data on the web continues to grow at a stunning pace. Often interesting web data are not in database systems but in HTML pages, XML pages, or text files. Data in these formats are not directly usable by standard SQL-like query processing engines that support sophisticated querying and reporting beyond keyword-based retrieval. Hence, the web users or applicat...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007